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Confidential -- Property of Incyte Corporation SeqServer Version 4.6 Jan 2002 

Program: blastp 
Sequence ID(s): 

□ 4325626CD1 vs. genpept!36 



NCBI -BLASTP 2.0.10 [Aug-26-1999] 




Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= 4325626CD1 

(549 letters) 

Database: genpeptl36 

1,453,555 sequences; 449,659,834 total letters 

Searching done 



Score E 

Sequences producing significant alignments: (bits) Value 



El 


g23452665 Rab GTPase-activating protein PRC 17 [Homo sapiens] 


1164 


0.0 


El 


gl2 053225 hypothetical protein [Homo sapiens] 


1160 


0.0 


El 


g37330 oncogene [Homo sapiens] 


855 


0.0 


EI 


g24474450 ubiquitin-specif ic protease USP6 [Homo sapiens] 


854 


0.0 


El 


g37333 oncogene [Homo sapiens] 


602 


e-171 


El 


g37 334 oncogene [Homo sapiens] 


257 


4e-67 


E 


g29881565 Expressed sequence AI316785 [Mus musculus] 


250 


9e-65 


Ei 


g27695568 related to the N terminus of tre [Homo sapiens] 


245 


2e-63 


El 


g2104571 KIAA0019 protein [Homo sapiens] 


245 


2e-63 


El 


g28703740 Hypothetical protein MGC51025 [Homo sapiens] 


235 


2e-60 



> g23452665 Rab GTPase-activating protein PRC 17 [Homo sapiens] 
Length = 549 



Score = 1164 bits (2978), Expect = 0.0 

Identities = 548/549 (99%), Positives = 548/549 (99%) 

Query: 1 MDWEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL 60 
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MDWEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL 
Sb j c t : 1 MDWEVAGSWWAQEREDI IMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL 6 0 

Query: 61 TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEM 120 

TAREAKQI RREI S RKSKWVDMLGDWEKYKS SRKL I DRAYKGMPMNI RGPMWS VLLN EEM 
Sbjct: 61 TAREAKQI RREI S RKSKWVDMLGDWEKYKS SRKL I DRAYKGMPMNI RGPMWSVLLNI EEM 120 

Query: 121 KLKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY 180 

KLKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY 
Sbjct: 121 KLKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY 180 

Query: 181 NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 240 

NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 
Sbjct: 181 NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 240 

Query: 241 HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 300 

HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 
Sbjct: 241 HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 300 

Query: 301 TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKQGDLPPP 360 

TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKQGDLPPP 
Sbjct: 301 TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKQGDLPPP 360 

Query: 361 AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 420 

AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 
Sbjct: 3 61 AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 420 

Query: 421 VREDTYPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 480 

VREDTYPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 
Sbjct: 421 VREDTYPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 480 

Query: 481 RAISQEDQLAPCWQAEHPAERVRSAFAAPSTDSDQGTPFRARDEQPCAPTSGPCLCGLHL 540 

RAISQEDQLAPCWQAEHPAERVRSAFAAPSTDSDQGTPFRARDEQPCAPTSGPCLCGLHL 
Sbjct: 481 RAISQEDQLAPCWQAEHPAERVRSAFAAPSTDSDQGTPFRARDEQPCAPTSGPCLCGLHL 540 

Query: 541 ESSQFPPGF 549 

ESSQFPPGF 
Sbjct: 541 ESSQFPPGF 549 



> g!2053225 hypothetical protein [Homo sapiens] 
Length = 549 

Score = 1160 bits (2969), Expect = 0.0 

Identities = 546/549 (99%), Positives = 548/549 (99%) 

Query: 1 MDWEVAGSWWAQEREDI IMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL 60 

MDWEVAGSWWAQEREDI IMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL 
Sbjct: 1 MDWEVAGSWWAQEREDI IMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL 60 

Query: 61 TAREAKQI RRE I SRKSKWVDMLGDWEKYKS SRKLI DRAYKGMPMNI RGPMWSVLLNTEEM 120 

TAREAKQI RRE I SRKSKWVDMLGDWEKYKSSRKLI DRAYKGMPMNI RGPMWSVLLNTEEM 
Sbjct: 61 TAREAKQI RRE I SRKSKWVDMLGDWEKYKS SRKL I DRAYKGMPMNI RGPMWSVLLNTEEM 120 

Query: 121 KLKNPGRYQ IMKEKGKRS SEH I QR I DRDVSGTLRKH I FFRDRYGTKQRELLH I LLAYEE Y 180 

KLKNPGRYQIMKEKGK+ S SEH I QR I DRDVSGTLRKH I FFRDRYGTKQRELLH I LLAYEE Y 
Sbjct: 121 KLKNPGRYQIMKEKGKKS SEH I QRIDRDVSGTLRKHI FFRDRYGTKQRELLH I LLAYEEY 180 

Query: 181 NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 240 

NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 
Sbjct: 181 NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 240 

Query: 241 HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 300 

HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 
Sbjct: 241 HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 300 
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Query: 301 TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVL.KHLRASMKKLTRKQGDLPPP 3 60 

TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRK+GDLPPP 
Sbjct: 301 TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKKGDLPPP 360 

Query: 361 AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 420 

AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 
Sbjct: 361 AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 420 

Query: 421 VREDT YPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 480 

VREDTYPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 
Sbjct: 421 WEDTYPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 480 

Query: 481 RAISQEDQLAPCWQAEHPAERVRSAFAAPSTDSDQGTPFRARDEQPCAPTSGPCLCGLHL 540 

RAISQEDQLAPCWQAEHPAERVRSAFAAPSTDSDQGTPFRARDEQ CAPTSGPCLCGLHL 
Sbjct: 481 RAISQEDQLAPCWQAEHPAERVRSAFAAPSTDSDQGTPFRARDEQQCAPTSGPCLCGLHL 540 

Query: 541 ESSQFPPGF 549 

ESSQFPPGF 
Sbjct: 541 ESSQFPPGF 549 



> g37330 oncogene [Homo sapiens] 
Length = 786 



Score = 855 bits (2185), Expect = 0.0 

Identities = 407/500 (81%), Positives = 440/500 (87%), Gaps = 1/500 (0%) 

Query: 1 MDWEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNlSnWDHLGIVHETELPPL 60 

MD+VE A S AQER+DI+MKY+KGHRAGLPEDKGP+P N+++D GI+HETELPP+ 
Sbjct : 1 MDMVENADSLQAQERKDILMKYDKGHRAGLPEDKGPEPV-GINSSIDRFGILHETELPPV 59 

Query: 61 TAREAKQI RRE I SRKS KWVDMLGDWEKYKS SRKL I DRAYKGMPMNI RGPMWSVLLNTEEM 120 

TAREAK+ 1 RRE + + R SKW+ +MLG+WE YK S KLIDR YKG+ PMNIRGP+WSVLLN +E+ 
Sbjct: 60 TAREAKKIRREMTRTSKWMEMLGEWETYKHSSKLIDRWKGIPMNIRGPVWSVLLNIQEI 119 

Query: 121 KLKNPGRYQ IMKEKGKRS S EH I QRI DRDVSGTLRKH I FFRDR YGTKQRELLH ILLAYEE Y 180 

KLKNPGRYQIMKE+GKRSSEHI ID DV TLR H+FFRDRYG KQREL + ILL AY EY 
Sbjct: 120 KLKNPGRYQ IMKERGKRSS EH IHH I DLDVRTTLRNHVFFRDRYGAKQRELFYILLAYSEY 179 

Query: 181 NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 240 

NPEVG YC RDL SHI ALFLLYLPEEDAFWALVQLLASERHSL GFHS PNGGTVQGLQDQQE 
Sbjct: 180 NPEVGYCRDLSHITALFLLYLPEEDAFWALVQLLASERHSLPGFHS PNGGTVQGLQDQQE 239 

Query: 241 HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 300 

HW SQPKTM HQDK+ LCGQC+ LGCL+R LIDGISLGLTLRLWDVYLVEGEQ LMPI 
Sbjct: 240 HWPKSQPKTMWHQDKEGLCGQCASLGCLLRNLIDGISLGLTLRLWDVYLVEGEQVLMPI 299 

Query: 301 TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKQGDLPPP 360 

T IA KVQQKRL KTSRCG WAR N+F DTWA ++DTVLKHLRAS KKLTRKQGDLPPP 
Sbjct: 300 TSIALKVQQKRLMKTSRCGLWARLRNQFFDTWAMNDDTVLKHLRASTKKLTRKQGDLPPP 359 

Query: 361 AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 420 

AK EQGS A RPVPASRGGKTLCKG RQAPPGPPA+F RPI SASPP A R STPCPGGA 
Sbjct: 360 AKREQGSLAPRPVPASRGGKTLCKGYRQAPPGPPAQFQRPICSASPPWASRFSTPCPGGA 419 

Query: 421 VREDTYPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 480 

VREDT Y PVGTQGV P S ALAQGGPQGSWRFL+W SMPRLPTDLD+ GPWF HYDF + SCWV 
Sbjct: 420 VREDT Y PVGTQGV PSLALAQGGPQGSWRFLEWKSMPRLPTDLD I GGPWFPHYDFERSCWV 479 



Query: 481 RAISQEDQLAPCWQAEHPAE 500 

RAISQEDQLA CWQAEH E 
Sbjct: 480 RAISQEDQLATCWQAEHCGE 499 
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> g24474450 ubiquitin-specif ic protease USPS [Homo sapiens] 
Length = 1406 

Score = 854 bits (2182), Expect = 0.0 

Identities = 407/500 (81%), Positives = 439/500 (87%), Gaps = 1/500 (0%) 

Query: 1 MDWEVAGSVWAQEREDIIMKYEKGHRAGL^ 60 

MD+VE A S AQER+DI+MKY+KGHRAGLPEDKGP+P N+++D GI+HETELPP+ 
Sbjct : 1 MDMVENADSLQAQERKDILMKYDKGHRAGLPEDKGPEPV-GINSSIDRFGILHETELPPV 59 

Query: 61 TAREAKQI RRE I SRKSKWVDMLGDWEKYKS SRKL I DRAYKGMPMNI RGPMWSVLLNTEEM 120 

TAREAK+IRRE++R SKW++MLG+WE YK S KLIDR YKG+PMNIRGP+WSVLLN + E+ 
Sbjct: 60 TAREAKKIRREMTRTSKV^EMLGEWETYKHSSKLIDRWKGIPMNIRGPVWSVLLNIQEI 119 

Query: 121 KLKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY 180 

KLKNPGRYQIMKE+GKRSSEHI ID DV TLR H+FFRDRYG KQREL +ILLAY EY 
Sbjct: 120 KLKNPGRYQ IMKERGKRS SEH I HH IDLDVRTTLRNHVFFRDRYGAKQRELF YI LLAYS E Y 179 

Query: 181 NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQE 240 

NPEVGYCRDLSHI ALFLLYLPEEDAFWALVQLLASERHSL GFHS PNGGTVQGLQDQQE 
Sbjct: 180 NPEVGYCRDLSH I TALFLLYLPEEDAFWALVQLLASERHSLPGFHS PNGGTVQGLQDQQE 239 

Query: 241 HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 300 

HW SQPKTM HQDK+ LCGQC+ LGCL+R LIDGISLGLTLRLWDVYLVEGEQ LMPI 
Sbjct: 240 HWPKSQPKTMWHQDKEGLCGQCASLGCLLRNLIDGISLGLTLRLWDVYLVEGEQVLMPI 299 

Query: 301 TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVliKHLRASMKKLTRKQGDLPPP 360 

T IA KVQQKRL KTSRCG WAR N+F DTWA ++DTVLKHLRAS KKLTRKQGDLPPP 
Sbjct: 300 TSIALKVQQKRLMKTSRCGLWARLRNQFFDTWAMlSroDTVTjKHLRASTKKL 359 

Query: 361 AKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGA 420 

AK EQGS A RPVPASRGGKTLCKG RQAPPGPPA+F RPI SASPP A R STPCPGGA 
Sbjct: 360 AKREQGSLAPRPVPASRGGKTLCKGYRQAPPGPPAQFQRPICSASPPWASRFSTPCPGGA 419 

Query: 421 VREDTYPVGTQGVPSPALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRHYDFRQSCWV 480 

VREDTYPVGTQGVPS ALAQGGPQGSWRFL+W SMPRLPTDLD+ GPWF HYDF SCWV 
Sbjct: 42 0 VREDTYPVGTQGVPSLALAQGGPQGSWRFLEWKSMPRLPTDLDIGGPWFPHYDFEWSCWV 479 

Query: 481 RAISQEDQLAPCWQAEHPAE 500 

RAISQEDQLA CWQAEH E 
Sbjct: 480 RAI SQEDQLATCWQAEHCGE 499 



> g37333 oncogene [Homo sapiens] 
Length = 376 

Score = 602 bits (1536), Expect = e-171 

Identities = 289/366 (78%), Positives = 318/366 (85%), Gaps = 1/366 (0%) 

Query: 1 MDWEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL 60 

MD+VE A S AQER+DI+MKY+KGHRAGLPEDKGP+P N+++D GI+HETELPP+ 
Sbjct: 1 MDMVENADSLQAQERKDILMKYDKGHRAGLPEDKGPEPV-GINSSIDRFGILHETELPPV 59 

Query: 61 TAREAKQI RREI SRKSKWVDMLGDWEKYKS SRKL I DRAYKGMPMNI RGPMWSVLLNTEEM 120 

TAREAK+ 1 RRE+ + R SKW+ +MLG+WE YK S KLIDR YKG+PMNIRGP+WSVLLN +E+ 
Sbjct: 60 T AREAKKI RREMTRTS KWMEMLGEWETYKHS SKL I DRVYKGI PMNI RGPVWSVLLNIQE I 119 

Query: 121 KLKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY 180 

KLKNPGRYQIMKE+GKRSSEHI ID DV TLR H+FFRDRYG KQREL +ILLAY EY 
Sbjct: 120 KLKNPGRYQIMKERGKRSSEHIHHIDLDVRTTLRNHVFFRDRYGAKQRELFYILLAYSEY 17 9 

Query: 181 NPEVGYCRDLSHI AALFLLYLPEEDAFWALVQLLASERHSLQGFHS PNGGTVQGLQDQQE 240 

NPEVGYCRDLSHI ALFLLYLPEEDAFWALVQLLASERHSL GFHS PNGGTVQGLQDQQE 
Sbjct: 180 NPEVGYCRDLSHI TALFLLYLPEEDAFWALVQLLASERHSLPGFHS PNGGTVQGLQDQQE 239 
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Query: 241 HWATSQPKTMGHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPI 300 

HW SQPKTM HQDK+ LCGQC+ LGCL+R L IDG I S LGLTLRLWDVYLVEGEQ LMPI 
Sbjct: 240 HWPKSQPKTMWHQDKEGLCGQCASLGCLLRNLIDGISLGLTLRLWDVYLVEGEQVLMPI 299 

Query: 301 TRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKQGDLPPP 360 

T I A KVQQKRL KTSRCG WAR N+F DTWA ++DTVLKHLRAS KKLTRKQGDLPPP 
Sbjct: 300 TSIALWQQKRLMKTSRCGLWARLRNQFFDTWAMNDDTVLKHLRASTKKLTRKQGDLPPP 359 

Query: 361 AKPEQG 366 
G 

Sbjct: 360 GPTALG 365 



> g37334 oncogene [Homo sapiens] 
Length = 1089 

Score = 257 bits (651), Expect = 4e-67 

Identities = 121/149 (81%), Positives = 125/149 (83%) 

Query: 352 RKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPR 411 

R D AK EQGS A RPVPASRGGKTLCKG RQAPPGPPA+F RPI SASPP A R 
Sbjct: 34 RDPKDSRDAAKREQGSLAPRPVPASRGGKTLCKGYRQAPPGPPAQFQRPICSASPPWASR 93 

Query: 412 S STPC PGGAVREDTYPVGTQGVPS PALAQGGPQGSWRFLQWNSMPRLPTDLDVEGPWFRH 471 

STPC PGGAVREDTYPVGTQGVPS ALAQGGPQGSWRFL+W SMPRLPTDLD+ GPWF H 
Sbjct : 94 FSTPCPGGAVREDTYPVGTQGVPSLALAQGGPQGSWRFLEWKSMPRLPTDLDIGGPWFPH 153 

Query: 472 YDFRQSCWVRAISQEDQLAPCWQAEHPAE 500 

YDF +SCWVRAISQEDQLA CWQAEH E 
Sbjct: 154 YDFERSCWVRAI SQEDQLATCWQAEHCGE 182 



> g29881565 Expressed sequence AI316785 [Mus musculus] 
Length = 707 

Score = 250 bits (631), Expect = 9e-65 

Identities = 151/445 (33%), Positives = 223/445 (49%), Gaps = 18/445 (4%) 

Query : 12 AQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNlSrVDHLGIVHETELPPLTAREAKQIRRE 7 1 

AQER +1+ KY++G E + Y D G +HE ELP A +Q + E 

Sbjct: 35 AQERAEIVAKYDRGREGAEIEPWEDADYLVYKVT-DRFGFLHEEELPYHNAAADRQKQLE 93 

Query: 72 ISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLKNPGRYQIM 131 

I R SKW+ ML WE+YK++ K R YKG+P+ +RG +W++LL +MK + Y + 
Sbjct: 94 IERTSKV^KMLKKWERYKNTEKFHRRIYKGIPLQLRGEVWALLLEIPKMKEETRDLYSKL 153 

Query: 132 KEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLS 191 

K + + S I++ID DV+ T R HI FRDRYG KQ+ L H+L AY YN EVGYC+ + S 
Sbjct: 154 KHRARGCSPDIRQIDLDVNRTFRDHIMFRDRYGVKQQSLFHVLAAYSIYNTEVGYCQGMS 213 

Query: 192 H I AALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQEHWAT SQPKTM 251 

I AL L+Y+ EEDAFWALV+L + +H++ GF + Q+ E ++ K 

Sbjct: 214 QITALLLMYMNEEDAFWALVKLFSGPKHAMHGFFVQGFPKLLRFQEHHEKILNKFLSKLK 273 

Query: 252 GHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKR 311 

H D +++ + + +D L LR+WD+Y+ EGE+ L ++ K+ +K 

Sbjct: 274 QHLDSQEIYTSFYTMKWFFQCFLDRTPFRLlsn^RIWDIYIFEGERVLTAMSYTILKLHKKH 333 

Query: 312 LTKTSRCGPWARFCNRFVDTWARD EDTVLKHLRASMKKLTRKQGDLPPPAKPEQ- - 365 

L K S +T A+D +D V++ L+ SM +L R + DLP P K ++ 

Sbjct: 334 LMKLS MEELVEFLQETLAKDFFFEDDFVIEQLQVSMAELKRAKLDLPEPGKEDEYP 389 

Query: 366 GSSASRPVPASRGGKTLCKGDRQ-APPGPPAR FPRPIWSASPPRAPRSSTPCPG 418 

+ P S L G R P P PR SP R+ TP 

Sbjct: 390 KKPLGQLPPESACVNHLSNGQRSVGRPSPKTSSRREDGSPRKNHEHSPVHHSRNGTPERA 449 
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Query: 419 GAVREDTYPVGTQGVPSPALAQGGP 443 

G R + G+ + + A +Q P 
Sbjct: 450 GQSRRKSVDEGSKNLKHEAESQRKP 474 



> g27695568 related to the N terminus of tre [Homo sapiens] 
Length = 828 

Score = 245 bits (620), Expect = 2e-63 

Identities = 148/434 (34%), Positives = 215/434 (49%), Gaps = 25/434 (5%) 

Query: 12 AQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAREAKQIRRE 71 

AQER +1+ KY++G E + Y D G +HE ELP +Q E 

Sbjct: 12 AQERAEIVAKYDRGREGAEIEPWEDADYLVYKVT-DRFGFLHEEELPDHNVAVERQKHLE 70 

Query: 72 ISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVIiLNTEEMKLKNPGRYQIM 131 

I R +KW+ ML WEKYK++ K R YKG+P+ +RG +W++LL +MK + Y + 
Sbjct: 71 IERTTKWLKMLKGWEKYKOTEKFHRRIYKGIPLQLRGEVWALLLEIPKMKEETRDLYSKL 130 

Query: 132 KEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLS 191 

K + + S I++ID DV+ T R HI FRDRYG KQ+ L H+L AY YN EVGYC+ +S 
Sbjct: 131 KHRARGCSPDIRQIDLDVNRTFRDHIMFRDRYGVKQQSLFHVLAAYSIYNTEVGYCQGMS 190 

Query: 192 HIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQEHWATSQPKTM 251 

I AL L+Y+ EEDAFWALV+L + +H+ + GF + Q+ E ++ K 

Sbjct: 191 QITALLLMYMNEEDAFWALVKLFSGPKHAMHGFFVQGFPKLLRFQEHHEKILNKFLSKLK 250 

Query: 252 GHQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKR 311 

H D +++ + + +D L LR+WD+Y+ EGE+ L + + K+ +K 

Sbjct: 251 QHLDSQEIYTSFYTMKWFFQCFLDRTPFTLNLRIWDIYIFEGERVLTAMSYTILKLHKKH 310 

Query: 312 LTKTSRCGPWARFCNRFVDTWARD EDTVLKHLRASMKKLTRKQGDLPPPAK P 363 

L K S F +T A+D +D V+ + L+ SM +L R + DLP P K P 

Sbjct: 311 LMKLS MEELVEFFQETLAKDFFFEDDFVIEQLQISMTELKRAKLDLPEPGKEDEYP 366 

Query: 3 64 EQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGAVRE 423 

++ PSGLGR GP SP+R + P 

Sbjct: 367 KKPLGQLPPELQSWGVHHLSNGQRSV--GRP SPLASGRRESGAPHRRHEH 414 

Query: 424 DTYPVGTQGVPSPA 437 

+P G P A 
Sbjct: 415 SPHPQSRTGTPERA 42 8 



> g2104571 KIAA0019 protein [Homo sapiens] 
Length =828 

Score = 245 bits (620), Expect = 2e-63 

Identities = 148/434 (34%), Positives = 215/434 (49%), Gaps = 25/434 (5%) 

Query: 12 AQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAREAKQIRRE 71 

AQER +1+ KY++G E + Y D G +HE ELP +Q E 

Sbjct: 12 AQERAE I VAKYDRGREGAE I EPWEDADYLVYKVT - DRFGFLHEEELPDHNVAVERQKHLE 70 

Query: 72 ISRKSKW\n3MLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLKNPGRYQIM 131 

I R +KW+ ML WEKYK++ K R YKG+P+ +RG +W++LL +MK + Y + 
Sbjct: 71 IERTTKWLKMLKGWEKYKNTEKFHRRIYKGIPLQLRGEWALLLEIPKMKEETRDLYSKL 130 

Query: 132 KEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLS 191 

K + + S I + + ID DV+ T R HI FRDRYG KQ+ L H+L AY YN EVGYC+ +S 
Sbjct: 131 KHRARGCSPDIRQIDLDVNRTFRDHIMFRDRYGVKQQSLFHVLAAYSIYNTEVGYCQGMS 190 

Query: 192 HIAALFLLYLPEEDAFWALVQLLASERHSLQGFHSPNGGTVQGLQDQQEHWATSQPKTM 251 
I AL L+Y+ EEDAFWALV+L + +H++ GF + Q+ E ++ K 
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BLAST2 Results 



http://patents.incyte.com: 8000/cgi-bin/SeqServer/SeqServi 



Sbjct : 


191 


Query: 


252 


Sbjct: 


251 


Query: 


312 


Sbjct: 


311 


Query: 


364 


Sbjct : 


367 


Query: 


424 


Sbjct: 


415 



191 QITALLLMYMNEEDAFWALVKLFSGPKHAMHGFFVQGFPKLLRFQEHHEKILNKFLSKLK 250 

HQDKKDLCGQCSPLGCLIRILIDGISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKR 311 
H D + + + + + +D L LR+WD+Y+ EGE+ L ++ K+ +K 



LTKTSRCGPWARFCNRFVDTWARD EDTVLKHLRASMKKLTRKQGDLPPPAK P 363 

L K S F +T A+D +D V++ L+ SM +L R + DLP P K P 

LMKLS MEELVEFFQETLAKDFFFEDDFVIEQLQISMTELKRAKLDLPEPGKEDEYP 366 



SGLGR GP SP+R + P 

)SWGVHHLSNGQRSV- -GRP SPLASGRRESGAPHRRHEH 414 



+P G P 



> g28703740 Hypothetical protein MGC51025 [Homo sapiens] 
Length = 250 

Score = 235 bits (594), Expect = 2e-60 

Identities = 120/206 (58%), Positives = 146/206 (70%), Gaps = 5/206 (2%) 

Query: 18 IIMKYEKGHRAGLPEDKGPKPF--RSYNNNVDHLGIVHETELPPLTAREAKQIRREISRK 75 

II KYE+GHRAG D G + R Y NN LGIVHE ELP ++A E KQ R+E R 
Sbjct: 19 IITKYEQGHRAGAAVDLGHEQVDVRKYTNN LGIVHEMELPRVSALEVKQRRKESKRT 75 

Query: 76 SKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLKNPGRYQIMKEKG 135 

+KW ML DW KY+S++KL R YK +P+ +RG WS+LL+ + +K +NPG+ Y+ +MKEKG 
Sbjct: 76 NKWQKMLADWTKYRSTKKLSQRWKVIPLAVRGRAWSLLLDIDRIKSQNPGKYKVMKEKG 135 

Query: 136 KRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLSHIAA 195 

KRSS I I DVS TL+KH+ F R+G KQ+EL IL+AY YNPEVGY RDLS I A 
Sbjct: 136 KRSSRI IHC IQLDVSHTLQKHMMFIQRFGVKQQELCDILVAYSAYNPEVGYHRDLSRITA 195 

Query: 196 LFLLYLPEEDAFWALVQLLASERHSL 221 

+ LL LPEED FWAL QLLA ERHSL 
Sbjct: 196 I LLLCL PEEDVFWALTQLLAGERHSL 221 



Database : genpept 13 6 

Posted date: Jul 31, 2003 9:27 AM 
Number of letters in database: 449,659,834 
Number of sequences in database: 1,453,555 

Lambda K H 

0.319 0.136 0.436 

Gapped 

Lambda K H 

0.270 0.0470 0.230 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 550686865 

Number of Sequences: 1453555 

Number of extensions: 25993291 

Number of successful extensions: 86626 

Number of sequences better than 10.0: 846 

Number of HSP's better than 10.0 without gapping: 199 

Number of HSP's successfully gapped in prelim test: 682 

Number of HSP's that attempted gapping in prelim test: 82213 
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BLAST2 Results 



http://patents.incyte.com:8000/cgi-bin/SeqServer/SeqServi 



Number of HSP's gapped (non-prelim) : 2853 

length of query: 549 

length of database: 449,659,834 

effective HSP length: 54 

effective length of query: 495 

effective length of database: 371,167,864 

effective search space: 183728092680 

effective search space used: 183728092680 

T: 11 



A: 
XI 
X2 
X3 
SI 



40 
16 
38 
64 
41 



( 7.4 bits) 
(14.8 bits) 
(24.9 bits) 
(21.7 bits) 



Submit sequences to;| BLAST 2 



HI 



Submi t 



Reset 



IrteyteGenomlcs 
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